Imputation Methods for Handling Item- Nonresponse in the Social Sciences: A Methodological Review

نویسنده

  • Gabriele B. Durrant
چکیده

Missing data are often a problem in social science data. Imputation methods fill in the missing responses and lead, under certain conditions, to valid inference. This article reviews several imputation methods used in the social sciences and discusses advantages and disadvantages of these methods in practice. Simpler imputation methods as well as more advanced methods, such as fractional and multiple imputation, are considered. The paper introduces the reader new to the imputation literature to key ideas and methods. For those already familiar with imputation methods the paper highlights some new developments and clarifies some recent misconceptions in the use of imputation methods. The emphasis is on efficient hot deck imputation methods, implemented in either multiple or fractional imputation approaches. Software packages for using imputation methods in practice are reviewed highlighting newer developments. The paper discusses an example from the social sciences in detail, applying several imputation methods to a missing earnings variable. The objective is to illustrate how to choose between methods in a real data example. A simulation study evaluates various imputation methods, including predictive mean matching, fractional and multiple imputation. Certain forms of fractional and multiple hot deck methods are found to perform well with regards to bias and efficiency of a point estimator and robustness against model misspecifications. Standard parametric imputation methods are not found adequate for the application considered.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Imputation Methods for Handling Item-Nonresponse in Practice: Methodological Issues and Recent Debates

Nonresponse is a major problem often faced by social scientists when analysing survey data. A range of methods exists to impute the missing responses but the choice between these methods may be difficult. This article reviews advantages and disadvantages of a range of imputation methods and provides guidance on how to use such methods in practice. The paper introduces the reader new to the impu...

متن کامل

Missing data in Wave 2 of NSHAP: prevalence, predictors, and recommended treatment.

OBJECTIVES This report seeks to inform National Social Life, Health, and Aging Project (NSHAP) data users of the prevalence and predictors of missing data in the in-person interview (CAPI) and leave-behind questionnaire (LBQ) in Wave 2 of NSHAP, and methods to handle missingness. METHOD Missingness is quantified at the unit and item levels separately for CAPI and LBQ data, and at the item lev...

متن کامل

Estimating Variance of the Sample Mean in Two-phase Sampling with Unit Non-response Effect

In sample surveys, we always deal with two types of errors: Sampling error and non-sampling error. One of the most common non-sampling errors is nonresponse. This error happens when some sample units are not observed or viewed but they do not answer some of the questions. The complete prevention of this error is not possible, but it can be significantly reduced. The non-response causes bias and ...

متن کامل

1985: Compensating for Wave Nonresponse in the 1979 Isdp Research Panel

The choice between weighting adjustments and imputation for handling missing survey data is generally straightforward: as a rule, weighting adjustments are used for total nonresponse and imputation is used for item nonresponses. There are, however, several situations where the choice is debatable. In general, these are situations of what might be termed partial nonresponse, where some data are ...

متن کامل

A Simulation Study to Evaluate the Robustness of Recent Methods for Preparing Variance Estimates in the Presence of Hot Deck Imputation

Many large-scale surveys currently use a variety of single imputation methods–as discussed by Chapman (1976), Cox (1980; and Kalton and Kasprzyk (1986)—to handle item nonresponse. Since the use of such imputation increases the underlying variation in the survey results, methods are needed to assess the impact. Until fairly recently, methods to assess the impact of the imputation on the variance...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005